Efficient Behavior of Small- World Networks 
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We introduce the concept of efficiency of a network, measuring how efficiently it exchanges informa- 
tion. By using this simpie measure small-world networks are seen as systems that are both globally 
and focally efficient. This allows to give a clear physical meaning to the concept of small-world, and 
also to perform a precise quantitative analysis of both weighted and unweighted networks. We study 
neural networks and man-made communication and transportation systems and we show that the 
underlying general principle of their construction is in fact a small- world principle of high efficiency. 
PACS numbers 89.70.+C, 05.90. +m, 87.18. Sn, 89.40. +k 



We live in a world of networks. In fact any complex 
system in nature can be modeled as a network, where 
vertices are the elements of the system and edges repre- 
sent the interactions between them. Coupled biological 
and chemical systems, neural networks, social interact- 
ing species, computer networks or the Internet are only 
few of such examples Q. Characterizing the structural 
properties of the networks is then of fundamental impor- 
tance to understand the complex dynamics of these sys- 
tems. A recent paper Q has shown that the connection 
topology of some biological and social networks is nei- 
ther completely regular nor completely random. These 
networks, there named small-worlds, in analogy with the 
concept of small-world phenomenon developed 30 years 
ago in social psychology ||, are in fact highly clustered 
like regular lattices, yet having small characteristics path 
lengths like random graphs. The original paper has trig- 
gered a large interest in the study of the properties of 
small- worlds (see ref. [Q for a recent review) . Researchers 
have focused their attention on different aspects: study 
of the inset mechanism |]-[7j] , dynamics j8| and spreading 
of diseases on small- worlds , applications to social net- 
works Jl(]|ll|] and to the Internet |l2|,[l3| • In this letter we 
introduce the concept of efficiency of a network, measur- 
ing how efficiently information is exchanged over the net- 
work. By using efficiency small- world networks results as 
systems that are both globally and locally efficient. This 
formalization gives a clear physical meaning to the con- 
cept of small-world, and also allows a precise quantita- 
tive analysis of unweighted and weighted networks. We 
study several systems, like brains, communication and 
transportation networks, and show that the underlying 
general principle of their construction is in fact a small- 
world principle, provided attention is taken not to ignore 
an important observational property (closure). 
We start by reexamining the original formulation pro- 
posed in ref. There, a generic graph G with N 
vertices and K edges is considered. G is assumed to 



be unweighted, i.e. edges are all equal, sparse (K <C 
N(N— 1)/2), and connected, i.e. there exists at least one 
path connecting any couple of vertices with a finite num- 
ber of steps. G is therefore represented by simply giving 
the adjacency (or connection) matrix, i.e. the N ■ N ma- 
trix whose entry ay is 1 if there is an edge joining vertex 
i to vertex j, and otherwise. An important quantity of 
G is the degree of vertex i, i.e. the number ki of edges 
incident with vertex i (the number of neighbours of i). 
The average value of hi is k = 2K/N. Once {(%} is 
given it can be used to calculate the matrix of the short- 
est path lengths between two generic vertices i and j. 
The fact that G is assumed to be connected implies that 
dij is positive and finite Vi ^ j. In order to quantify the 
structural properties of G, B proposes to evaluate two 
different quantities: the characteristic path length L and 
the clustering coefficient C . L is the average distance be- 
tween two generic vertices L = w^rn Si^j ^ij j an d C 
is a local property defined as C — wYli^i- Here Cj is 
the number of edges existing in G;, the subgraph of the 
neighbors of i, divided by the maximum possible number 
ki(ki — l)/2. In H a simple method is considered to pro- 
duce a class of graphs with increasing randomness. The 
initial graph G is taken to be a one-dimensional lattice 
with each vertex connected to its k neighbours and with 
periodic boundary conditions. Rewiring each edge at ran- 
dom with probability p, G can be tuned in a continuous 
way from a regular lattice (p = 0) into a random graph 
{p = 1). For the regular lattice we expect L ~ N/2k and 
a high clustering coefficient C — 3/4(fe— 2)/(k — 1), while 
for a random graph L ~ lnN/ln(k — 1) and C ~ k/N 
Jl4],Pt . Although in the two limit cases a large C is asso- 
ciated to a large L and vice versa a small C to a small L, 
the numerical experiment reveals an intermediate regime 
at small p where the system is highly clustered like regu- 
lar lattices, yet having small characteristics path lengths 
like random graphs. This behavior is there called small- 
world and it is found to be a property of some social and 
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biological networks analyzed M . 

Now we propose a more general set-up to investigate 
real networks. We will show that: 

- the definition of small-world behavior can be given in 
terms of a single variable with a physical meaning, the 
efficiency E of the network. 

- 1/L and C can be seen as first approximations of E 
evaluated resp. on a global and on a local scale. 

- we can drop all the restrictions on the system, like un- 
weightedness, connectedness and sparseness. 

We represent a real network as a generic weighted (and 
possibly even non sparse and non connected) graph G. 
Such a graph needs two matrices to be described: the ad- 
jacency matrix {a^} defined as for the unweighted graph, 
and the matrix {iij} of physical distances. The number 
£ij can be the space distance between the two vertices or 
the strength of their possible interaction: we suppose iij 
to be known even if in the graph there is no edge between 
i and j. To make some examples, iij can be the geograph- 
ical distance between stations in transportation systems 
(in such a case iij respects the triangle equality, though 
this is not a necessary assumption), the time taken to ex- 
change a packet of information between routers in the In- 
ternet, or the inverse velocity of chemical reactions along 
a direct connection in a biological system. Of course, in 
the particular case of an unweighted graph iij = 1 Vi ^ j . 
The shortest path length dij between two generic points 
i and j is the smallest sum of the physical distances 
throughout all the possible paths in the graph from i 
to j. The matrix {dij} is therefore calculated by using 
the information contained both in matrix {(%} and in 
matrix {iij}. We have dij > iij Vi,j, the equality being 
valid when there is an edge between i and j. Let us now 
suppose that the system is parallel, i.e. every vertex sends 
information concurrently along the network, through its 
edges. The efficiency tij in the communication between 
vertex i and j can be then defined to be inversely propor- 
tional to the shortest distance: = 1/dij V«,j. When 
there is no path in the graph between i and j, dij = +00 
and consistently = 0. The average efficiency of G can 
be defined as: 

F(o = = 1 v — m 

1 ' N(N-l) N (N-l).^ G dij { > 

To normalize E we consider the ideal case Gid in which 
the graph G has all the N(N — l)/2 possible edges. In 
such a case the information is propagated in the most ef- 
ficient way since dij = iij and E assumes its maxi- 
mum value E(G id ) = JV(J ^_ 1) Z^jgo T~- Tne efficiency 
E(G) considered in the following of the paper is always 
divided by L(G id ) and therefore < E{G) < 1. Though 
the equality E = 1 is valid when there is an edge between 
each couple of vertices, real networks can reach a high 
value of E. 



In our formalism, we can define the small-world be- 
haviour by using the single measure E to analyze both 
the local and global behaviour, rather than two different 
variables L and C . The quantity in eq. (]l|) is the global 
efficiency of G and we therefore name it -Egiob- Since E 
is defined also for a disconnected graph we can character- 
ize the local properties of G by evaluating for each ver- 
tex i the efficiency of Gi, the subgraph of the neighbors 
of i. We define the local efficiency as the average effi- 
ciency of the local subgraphs, E\ oc = l/^V^ieG E{G{). 
This quantity plays a role similar to the clustering co- 
efficient C. Since i ^ Gi, the local efficiency E\ oc tells 
how much the system is fault tolerant, thus how efficient 
is the communication between the first neighbours of i 
when i is removed |l5|] . The definition of small- world can 
now be rephrased and generalized in terms of the infor- 
mation flow: small-world networks have high L g i b and 
-Eioc i.e. are very efficient in global and local communi- 
cation. This definition is valid both for unweighted and 
weighted graphs, and can also be applied to disconnected 
and/or non sparse graphs. 

It is interesting to see the correspondence between our 
measure and the quantities L and C of Q (or, corre- 
spondingly, 1/L and C). The fundamental difference is 
that -Lgiob is the efficiency of a parallel systems, where 
all the nodes in the network concurrently exchange pack- 
ets of information (such are all the systems in ||, for 
example), while 1/L measures the efficiency of a sequen- 
tial system (i.e. only one packet of information goes 
along the network). 1/L is a reasonable approximation 
of -Lgiob when there are not huge differences among the 
distances in the graph, and this can explain why L works 
reasonably well in the unweighted examples of Q. But, 
in general 1/L can significantly depart from Lgiob. For 
instance, in the Internet, having few computers with an 
extremely slow connection does not mean that the whole 
Internet diminishes by far its efficiency: in practice, the 
presence of such very slow computers goes unnoticed, be- 
cause the other thousands of computers are exchanging 
packets among them in a very efficient way. Here 1/L 
would give a number very close to zero (strictly in the 
particular case when a computer is disconnected from 
the others and L = +00), while -E g i b gives the correct 
efficiency measure of the Internet. We turn now our at- 
tention to the local properties of a network. C is only one 
among the many possible intuitive measures |]Io|| of how 
well connected a cluster is. It can be shown that when 
in a graph most of its local subgraphs Gi are not sparse, 
then C is a good approximation of E\ oc . Summing up 
there are not two different kinds of analysis to be done 
for the global and local scales, but just one with a very 
precise physical meaning: the efficiency in transporting 
information. 

We now illustrate the onset of the small-world in an un- 
weighted graph by means of the same example used in pi . 
A regular lattice with N — 1000 and k — 20 is rewired 
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with probability p and i? g i b and E\ oc are reported in fig.l 
as functions of p |Q . For p = we expect the system to 
be inefficient on a global scale (-Bgiob ~ k/N log(N/K)) 
but locally efficient. The situation is inverted for the ran- 
dom graph. In fact at p — 1 £giob assumes a maximum 
value of 0.4, meaning 40% the efficiency of the ideal graph 
with an edge between each couple of vertices. This at the 
expenses of the fault tolerance (E\ oc ~ 0). 
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FIG. 1. FIG.l Global and local efficiency for the graph 
example considered in A regular lattice with N = 1000 
and k — 20 is rewired with probability p. The small-world 
behaviour results from the increase of i? g iob caused by the 
introduction of only a few rewired edges (short cuts), which 
on the other side do not affect Ei oc . At p ~ 0.1, U g iob has 
almost reached the value of the random graph, though E\ oc 
has only diminished by very little from the value of 0.82 of 
the regular lattice. Small worlds have high E g \ h and Ei ac . 

The small-world behaviour appears for intermediate 
values of p. It results from the fast increase of -E g iob (for 
small p wc find a linear increase of -E g i b in the logarith- 
mic horizontal scale) caused by the introduction of only 
a few rewired edges (short cuts), which on the other side 
do not affect E\ oc . At p ~ 0.1, -E g i b has almost reached 
the maximum value of 0.4, though E\ oc has only dimin- 
ished by very little from the maximum value of 0.82. For 
an unweighted case the description in terms of network 
efficiency resembles the approximation given in S. In 
particular we have checked that a good agreement with 
curves L(p) and C(p) ^ can be obtained by reporting 
l/Egiobip) and E\ ac {p). Of course in such an example the 
short cuts connect at almost no cost vertices that would 
otherwise be much farther apart (because 1^ = 1 Vi 7^ j). 
On the other hand this is not true when we consider a 
weighted network. 

As real networks we consider first different examples of 
natural systems (neural networks), and then we turn our 
attention to man-made communication and transporta- 
tion systems. 

1 ) Neural Networks. Thanks to recent experiments 



neural structures can be studied at several levels of scale. 
Here we focus first on the analysis of the neuroanatom- 
ical structure of cerebral cortex, and then on a simple 
nervous system at the level of wiring between neurons. 
The anatomical connections between cortical areas are 
of particular importance for their intricate relationship 
with the functional connectivity of the cerebral cortex 
]l8[ . We analyze two databases of cortico-cortical con- 
nections in the macaque and in the cat fll9|]. Tab.l indi- 
cates the two networks are small- worlds |L6fl: they have 
high -E g iob, respectively 52% and 69% the efficiency of 
the ideal graph with an edge between each couple of ver- 
tices (just slightly smaller than the best possible values of 
57% and 70% obtained in random graphs) and high E\ oc , 
respectively 70% and 83%, i.e. high fault tolerance [f22| . 
These results indicate that in neural cortex each region is 
intermingled with the others and has grown following a 
perfect balance between local necessities (fault tolerance) 
and wide-scope interactions. Next we consider the neural 
network of C. elegans, the only case of a nervous system 
completely mapped at the level of neurons and chemical 
synapses J23j. Tab.l shows that this is also a small- world 
network: C. elegans achieves both a 50% of global and 
local efficiency. Moreover the value of -Egiob is similar to 
E\ oc . This is a difference from cortex databases where 
fault tolerance is slighty privileged with respect to global 
communication. 

2) Communication Networks. We have considered two 
of the most important large-scale communication net- 
works present nowadays: the World Wide Web and the 
Internet. Tab. 2 shows that they have relatively high val- 
ues of -©giob (slightly smaller than the best possible val- 
ues obtained for random graphs) and E\ oc . Despite the 
WWW is a virtual network and the Internet is a physi- 
cal network, at a global scale they transport information 
essentially in the same way (as their -Egiob's are almost 
equal). At a local scale, the bigger E\ QC in the WWW case 
can be explained both by the tendency in the WWW to 
create Web communities (where pages talking about the 
same subject tend to link to each other), and by the fact 
that many pages within the same site are often quickly 
connected to each other by some root or menu page. 

3) Transport Networks. Differently from previous 
databases the Boston subway transportation system 
(MBTA) can be better described by a weighted graph, 
the matrix {£ij} being given by the geographical dis- 
tances between stations. If we consider the MBTA as 
an unweighted graph we obtain that it is apparently nei- 
ther locally nor globally efficient (see Tab. 3). On the 
other hand, when we take into account the geographical 
distances, we have -E g i b = 0.63: this shows the MBTA is 
a very efficient transportation system on a global scale, 
only 37% less efficient than the ideal subway with a di- 
rect tunnel from each station to the others. Even in the 
weighted case E\ oc stays low (0.03), indicating a poor 
local behaviour: differently from a neural network the 
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MBTA is not fault tolerant and a damage in a station 
will dramatically affect the connection between the pre- 
vious and the next station. The difference with respect 
to neural networks comes from different needs and prior- 
ities in the construction and evolution mechanism: when 
we build a subway system, the priority is given to the 
achievement of global efficiency, and not to fault toler- 
ance. In fact a temporary problem in a station can be 
solved by other means: for example, walking, or taking a 
bus from the previous to the next station. That is to say, 
the MBTA is not a closed system: it can be considered, 
after all, as a subgraph of a wider transportation network. 
This property is of fundamental importance when we an- 
alyze a system: while global efficiency is without doubt 
the major characteristic, it is closure that somehow leads 
a system to have high local efficiency (without alterna- 
tives, there should be high fault-tolerance). The MBTA 
is not a closed system, and thus this explains why, unlike 
in the case of the brain, fault tolerance is not a critical 
issue. Indeed, if we increase the precision of the analysis 
and change the MBTA subway network by taking into 
account, for example, the Boston Bus System, this ex- 
tended transportation system comes back to be a small- 
world network (-Egiob = 0.72, E\ oc = 0.46). Qualitatively 
similar results, confirming the similarity of construction 
principles, have been obtained for other undergrounds 
and for a wider transportation system consisting of all 
the main airplane and highway connections throughout 
the world p5j. Considering all the transportation alter- 
natives available at that scale makes again the system 
closed (there are no other reasonable routing alterna- 
tives), and so fault-tolerance comes back as a leading 
construction principle. 

Summing up, the introduction of the efficiency mea- 
sure allows to give a definition of small- world with a clear 
physical meaning, and provides important hints on why 
the original formulas of [Q] work reasonably well in some 
cases, and where they fail. The efficiency measure al- 
lows a precise quantitative analysis of the information 
flow, and works both in the unweighted abstraction, and 
in the more realistic assumption of weighted networks. 
Finally, analysis of real data indicates that various ex- 
isting (neural, communication and transport) networks 
exhibit the small- world behaviour (even, in some cases, 
when their unweighted abstractions do not), substanti- 
ating the idea that the diffusion of small-world networks 
can be interpreted as the need to create networks that 
are both globally and locally efficient. 
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TABLE I. Macaque and cat cortico-cortical connections 
p9| . The macaque database contains N — 69 cortical areas 
and K — 413 connections p0| . The cat database has N = 55 
cortical areas (including hippocampus, amygdala, entorhinal 
cortex and subiculum) and K — 564 (revised database and 
cortical parcellation from ]2l]]). The nervous system of C. 
elegans consists of N — 282 neurons and K = 2462 links 
which can be either synaptic connections or gap junctions 
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-Bgiob -Bioc 




Macaque 
Cat 


0.52 0.70 
0.69 0.83 




C. elegans 


0.46 0.47 
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TABLE II. Commu nication networks. Data on 

the World Wide Web from jittp:/ /www. nd.edu/~networks 
contains TV = 325729 documents and K = 1090108 
links [p^|, while th e Internet database is taken from 
http://moat.nlanr.net and has TV = 6474 nodes and 



K = 12572 links. 





-Bglob 


-Bloc 




WWW 


0.28 


0.36 




Internet 


0.29 


0.26 





TABLE III. The Boston underground transportation 
system (MBTA) consists of TV = 124 stations and K = 124 
tunnels. The matrix {lij} of the spatial distances between 
stations, used for t he weighted case, has be en calculated us- 
ing databases from jittp: / /www. mbta.com/ and the U.S. Na- 



tional Mapping Division. 





-Eglob -Eloc 




MBTA (unweighted) 
MBTA (weighted) 


0.10 0.006 
0.63 0.03 





